On the Use of Augmented Hmm Models for Overcoming Time and Parameter Independence Assumptions in Asr
نویسندگان
چکیده
There is significant interest in developing new acoustic models for speech recognition that overcome traditional HMM restrictions. In this work, we propose to use Ngram based augmented HMMs. Two approaches are presented. The first one consists on overcoming the parameter independence assumption. This is achieved by modeling the dependence between the different acoustic parameters, using N-gram modeling. Then, the input signal is mapped to the new probability space. The second proposal tries to overcome the time independence assumption, by modeling temporal dependencies of each acoustic feature. Different configurations have been tested for connected digit and continuous speech recognition, results showing that adding long span information is beneficial for ASR performance.
منابع مشابه
How Neural Network Depth Compensates for HMM Conditional Independence Assumptions in DNN-HMM Acoustic Models
While DNN-HMM acoustic models have replaced GMMHMMs in the standard ASR pipeline due to performance improvements, one unrealistic assumption that remains in these models is the conditional independence assumption of the Hidden Markov Model (HMM). In this work, we explore the extent to which depth of neural networks helps compensate for these poor conditional independence assumptions. Using a bo...
متن کاملEnd-to-End Speech Recognition Models
For the past few decades, the bane of Automatic Speech Recognition (ASR) systems have been phonemes and Hidden Markov Models (HMMs). HMMs assume conditional independence between observations, and the reliance on explicit phonetic representations requires expensive handcrafted pronunciation dictionaries. Learning is often via detached proxy problems, and there especially exists a disconnect betw...
متن کاملWhat HMMs Can Do
Since their inception almost fifty years ago, hidden Markov models (HMMs) have have become the predominant methodology for automatic speech recognition (ASR) systems — today, most state-of-the-art speech systems are HMM-based. There have been a number of ways to explain HMMs and to list their capabilities, each of these ways having both advantages and disadvantages. In an effort to better under...
متن کاملA Study on the Use of Conditional Random Fields for Automatic Speech Recognition
Current state of the art systems for Automatic Speech Recognition (ASR) use statistical modeling techniques such as Hidden Markov Models (HMMs) and Gaussian Mixture Models (GMMs) to recognize spoken language. These techniques make use of statistics derived from the acoustic frequencies of the speech signal. In recent years, interest has been rising in the use of phonological features derived fr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008